USAAR-CHRONOS: Crawling the Web for Temporal Annotations
نویسندگان
چکیده
This paper describes the USAAR-CHRONOS participation in the Diachronic Text Evaluation task of SemEval-2015 to identify the time period of historical text snippets. We adapt a web crawler to retrieve the original source of the text snippets and determine the publication year of the retrieved texts from their URLs. We report a precision score of >90% in identifying the text epoch. Additionally, by crawling and cleaning the website that hosts the source of the text snippets, we present Daikon, a corpus that can be used for future work on epoch identification from a diachronic perspective.
منابع مشابه
CHRONOS: A Reasoning Engine for Qualitative Temporal Information in OWL
We propose CHRONOS, a system for reasoning over temporal information in OWL ontologies. Representing both qualitative temporal (i.e., information whose temporal extents are unknown such as “before”, “after” for temporal relations) in addition to quantitative information (i.e., where temporal information is defined precisely e.g., using dates) is a distinctive feature of the proposed approach. Q...
متن کاملPrioritize the ordering of URL queue in Focused crawler
The enormous growth of the World Wide Web in recent years has made it necessary to perform resource discovery efficiently. For a crawler it is not an simple task to download the domain specific web pages. This unfocused approach often shows undesired results. Therefore, several new ideas have been proposed, among them a key technique is focused crawling which is able to crawl particular topical...
متن کاملCHRONOS: A Tool for Handling Temporal Ontologies in Protégé
Representing information evolving in time in ontologies is a difficult problem to deal with. Temporal relations are in fact ternary (i.e., properties of objects that change in time involve also a temporal value in addition to the object and the subject) and cannot be handled directly by OWL. The standard solution to this problem is to introduce new (intermediate) classes into the ontology and m...
متن کاملCHRONOS Ed: A Tool for Handling Temporal Ontologies in Protégé
Representing information evolving in time in ontologies is a difficult problem to deal with. Temporal relations are in fact ternary (i.e., properties of objects that change in time involve also a temporal value in addition to the object and the subject) and cannot be handled directly by OWL. The standard solution to this problem is to map all temporal relations to a set of binary ones with new ...
متن کاملτOWL: A Framework for Managing Temporal Semantic Web Documents
The World Wide Web Consortium (W3C) OWL 2 Web Ontology Language (OWL 2) recommendation is an ontology language for the Semantic Web. It allows defining both schema (i.e., entities, axioms, and expressions) and instances (i.e., individuals) of ontologies. OWL 2 ontologies are stored as Semantic Web documents. However, OWL 2 lacks explicit support for time-varying schema or for time-varying insta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015